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1879.] On the Law of the Geometric Mean . 

of common occurrence. The one is that of ordinary increase, as 
exemplified by the growth of population, where an already large 
nation tends to become larger than a small one under similar cir¬ 
cumstances, or when a capital employed in a business increases in 
proportion to its size. The other category is that of surrounding 
influences, or “ milieux ” as they are often called, such as a period of 
plenty in which a larger field or a larger business yields a greater 
excess over its mean yield than a smaller one. Most of the causes 
of those differences with which sociology are concerned, and which 
are not purely vital phenomena, such as those already discussed, may 
be classified under one or other of these two categories, or under such 
as are in principle almost the same. In short, sociological phenomena, 
like vital phenomena are, as a general rule, subject to the condition of 
the geometric mean. 

The ordinary law of Frequency of Error, based on the arithmetic 
mean, corresponds, no doubt, sufficiently well with the observed facts 
of vital and social phenomena, to be very serviceable to statisticians, 
but it is far from satisfying their wants, and it may lead to absurdity 
when applied to wide deviations. It asserts that deviations in excess 
must be balanced by deviations of equal magnitude in deficiency 
therefore, if the former be greater than the mean itself, the latter 
must be less than zero, that is, must be negative. This is an impossi¬ 
bility in many cases, to which the law is nevertheless applied by statis¬ 
ticians with no small success, so long as they are content to confine its 
application within a narrow range of deviation. Thus, in respect of 
stature, the law is very correct in respect to ordinary measurements, 
although it asserts that the existence of giants, whose height is more 
than double the mean height of their race, implies the possibility of 
the existence of dwarfs, whose stature is less than nothing at all. 

It is, therefore, an object not only of theoretical interest but of 
practical use, to thoroughly investigate a Law of Error, based on the 
geometric mean, even though some of the expected results may 
perhaps be apparent at first sight. With this view I placed the fore¬ 
going remarks in Mr. Donald McAlister’s hands, who contributes the 
following memoir. 


XIII. “ The Law of the Geometric Mean.” By Donald 
McAlister, B.A., B.Sc., Fellow of St. John’s College, 
Cambridge. Communicated by Francis Galton, F.R.S. 
Received October 21, 1879. 

Suppose we have before us a large number of measurements. They 
may either be all approximations to the true value of a single unknown 
quantity, or may refer to the several members of a large class. The 
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measurements will disagree among tliemselves, but on arranging them 
in order of size they show a tendency to cluster round some medium 
value. We are naturally inclined to infer that the true value of 
the unknown, or typical member of the class, is not far from this 
value. How to define and determine the appropriate medium in 
various classes of measurement becomes thus a natural object of 
inquiry. On examination we find that there is no strict and final 
criterion applicable to all cases. We have to start with an empirical 
assumption, more or less justifiable on general grounds, but not 
capable of rigid proof. In the ordinary Theory of Errors, which deals 
primarily with discrepant observations , the assumption made is re¬ 
ducible to this :—that, on the whole, the best medium we can take is 
the Arithmetic Mean of the discordant measures. This is equivalent 
to the statement that errors (or differences from the truth) of equal 
amount in excess or defect are equally likely to occur. In the class of 
measures referred to by Mr. Galton, which are not of the nature of 
instrumental observations, reason is given for thinking that, on the 
whole, a better medium value is likely to be obtained by taking the 
Geometric Mean of the discrepant measures. This is equivalent to the 
assumption that measures are equally likely which bear to the truth 
ratios of equal amount in excess or defect, so to speak. For example, 
that measures which are respectively ('HHnOth and (xTnrr)th of the 
truth are equally likely to occur. This paper seeks to develop some 
consequences of this fundamental principle. 

The practical outcome is a method for the treatment of a series of 
measures which naturally group themselves round their Geometric 
Mean. This method may be briefly presented as follows :— 

Let the measures be 


'H, «... 02 n , 

Take the (hyperbolic) logarithm of each measure, thus forming a new 
series, say 

■ ■ ■ y,r, where y,~\og e x r . 

Form the Arithmetic Mean (A.M.) of the a?’s, so that 

w(A.M.) = 2a?. 

Form the Arithmetic Mean of the y* s: this will be the logarithm of 
the Geometric Mean (G.M.) of the aj’s, so that 

n log (G.M.) —^y. 

If then -—==4 (log (A.M.)—log (G.M.)), we have in h a “ measure of 

precision ” of the series of afs which we call the “ iveight .” 

The series of y’s may then be treated as a series grouped round its 
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Arithmetic Mean , and the formulae of the ordinary theory are at once 
applicable to it, the weight ascribed to it being h, as above defined. 

When the ^-series is so treated, we can at once pass from it to the 
^-series. Thus the probability, frequency, &c., of any x are the same 
as for the corresponding y : and any term interpolated in the y- series 
will be the logarithm of the corresponding term in the ir-series. 

1. If it be granted that the Geometric Mean of the measures is the 
most probable value of the quantity measured, it follows that measures 
which differ from the mean have a probability which becomes less 
as the discrepancy becomes greater, either in excess or defect. We 
naturally seek for some scale which shall define the respective proba¬ 
bility to be assigned to each measure of any given series. 

Thus, if x be one of the measures, and a be the Geometric Mean of the 
whole series, and if we can assign a form to the function 0, such that 

0^?) is the probability of x, the curve y—4> will afford us (gra¬ 
phically) the required scale, and 0^-^will be the “ Law of Frequency .” 

2. By a method analogous to the first method of Gauss (“ Theoria 
Motus,” ii, 3, 186), it can be shown that 

exp. (-ft* logs r^EEBe-'K 10 ^) 3 , 


where B and h are constants. B is determined from the condition 
that the sum of the probabilities of all the measures must be unity ; h 
depends on the grouping of the measures, being large when they agree 
closely and small when they are generally discordant; h is thus taken 
as a measure of the precision of the series in general, and is briefly 
called the “ weight ” of the series , or sometimes the “ weight 99 of the 
mean derived therefrom. (See fig. 1.) 


Fia. 1. 



2/ = B exp. ^ - A 2 log 2 -). 


3. In forming a scale of probability, 0 



, which shall be appli- 
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cable to any series of measures of which tbe mean and the weight are 
known, we mnst suppose the value of 0 to be given (either graphi¬ 
cally or by table) for all possible values of x. In other words, we 
mnst suppose the number of entries to be indefinitely great and 
indefinitely close together. It is clear that the actual probability of 
any individual measure becomes in such a series infinitesimal, though 
the relative magnitudes of any pair of probabilities remain unaltered. 
This is indicated in the formula of §2 by the constant B becoming 
infinitesimal. To avoid this inconvenience we are led to modify our 
original problem, and ask, not “ what is the probability of a measure 
x ? ” but “what is the probability of a measure lying between the 
very close limits a? and x -f Sx ? 99 

4. From the principle indicated above, viz., that on the whole there 
are as many measures less than the mean as there are greater than 
the mean, we deduce that the required probability is 


h ( 


W log 3 - 



the constant factor h-i- v tt results from the condition that the 
sum of all the values of this function from x=0 to x=co must be 
unity. And it will be found that 



dx ___ 


x 


as should be the case. 


The coefficient of Sx, viz .:—h exp. log 3 - ir . x, I have 

ventured to distinguish from the previous function by calling it the 
“ Laiv of Facility .” This function may be defined in words as “the 
ratio which the chance that a measure lies within a given small 
interval bears to the magnitude of the interval. 

5. Tracing the curve which represents the “ Law of Facility/ ’ we 
find that its form resembles that of the curve of frequency, but that 

the maximum ordinate occurs at x=a exp. in the former, and 


of course at x=a in the latter. The latter result implies that the 
mean is the most probable value : the former means that there is more 
chance of a measure lying within a given small fraction (Sx) of 


“ “ p (~i) 


than within the same small fraction of any other value 


that can be named. 

6. We may graphically represent our results in three ways 
1°. Trace the curve y^/ tt-Ii exp. (—7z, 3 log 3 af), (the mean being" 
taken as unity). Let the space between the curve and Ox be the boundary 
of a lamina of varying density. Take the density along the ordinate 
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at x to be 


1 

x 


Then tbe ordinate at x is proportional to the (in¬ 


finitesimal) probability of x, and the mass of the section bounded by 
the ordinates at x lf x 2 measures the probability of a measure lying 
between aq and x 2 . If we are dealing with a very large number of 
measures, this mass will be found to represent the number of measures 
which, actually lie between those limits. The mass of the lamina is 
bisected by the ordinate at aj=l (fig. 2). 


Fig. 2. 



y f 7T — £exp. (—A 2 log 2 #). 

density, at x — i. 

x 


2°. Regard the equation yfir—li exp. (— 1$ log 3 x) as that of a 
cylinder. Let it be cut by another cylinder at right angles to 

it. The solid included between these surfaces and the planes xz, xy r 
obviously corresponds to the lamina of 1°, volume being read for mass 
(fig. 8). Methods 1° and 2° thus represent in one figure both the fre¬ 
quency and the facility functions. 
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3°. The law of facility alone will be represented by the curve 
xyS 7r=A,exp. ( — hPlog^x). 

Here the area included between any two ordinates, the curve, and the 
axis of x , measures the probability of a measure lying between the 
corresponding abscissae The area of the curve is bisected at x~l^ 

but the maximum ordinate is at a?=exp. (-i) (fig- 4 )- 

Fig. 4. 



! xy v t r — h exp. ( — Ji 2 log 2 oc ). 

° A=eXp ' ("si)- 

AB = ——exp. f — Y 

Si w 


7. (Quetelet’s method.) Suppose the measures we are considering 
to be a series of estimates of the magnitude of some single object 
(such as the depth of a given tint). In forming such an estimate the 
mind is acted on by many small causes tending to make us err either 
in excess or defect. Suppose each small cause acts in such a way as 
to make us under- or over-estimate in a fixed ratio; aild as a particular 
case let this fixed ratio be the same for all the small causes. If, then, 
half the causes tend to over- and half to under-estimation, we 
estimate rightly. Our estimate in any given case depends on the par¬ 
ticular combination of the causes which has (so far as our knowledge 
ds concerned) fortuitously arisen. Let, then, the number of causes be 
2 n, the fixed ratio a. The total number of possible combinations will 
be 2 %l . The resulting possible ratios are 




a, 2w+3 , oT^ n . 


The chance of a correct estimate (ratio 1) is by the theory of com¬ 
binations |2^-r-2 2,i |n | n : the chance of an estimate bearing a ratio 
M- r to the truth will be |2^-f-2 w | n—r |w + r. Let now n become 
very large, and « very near to unity; while r remains finite. Then, 
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by a known approximation, the chance ultimately takes the form 

1 / r 2 \ 

• - exp. I ——I. If we are asked the chance of a ratio x , we put 

V H7T " \ n / 

x=.a r , and thus find log x—r log a ; the chance required is then 



corresponding to the law of frequency. 

Next determining the chance that the estimate shall lie between 
narrow limits x and x + x\ we find that we have to multiply the above 
expression by x 1 ~rx\o g«. Thus, if we take Zr" 3 =w(loga) 2 , the ex¬ 
pression agrees exactly with that already found for the law of facility. 
This verification is interesting, for though it depends on certain 
special suppositions, the process seems to throw light on the genesis of 
the law, and the significance of the modulus h. 

8. Another method of exhibiting the law, suggested by Mr. Galton’s 
Method of Intercomparison, is the following. Let the series of 
measures be represented by a series of ordinates: arrange these side 
by side at equal small distances and in order of magnitude. Their 
extremities will then lie on a curve of contrary flexure, which Mr. 
Galton calls an Ogive ; we may speak of it as the “ curve of distribu - 
tion .” Its equation can be shown to be 

f A log 7/ 

hhx— I e * 2 dt~e rf. (Ji log y), 

h being a constant depending on the interval between the ordinates as 
they stand (fig. 5). 


Lia. 5. 



hkx = erf. (7* log?/). 
OA=OA'= 

OB = OB' = iOA. 
BO = B'C'=iOA. 
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9. We may next show that if the Arithmetic Mean (A.M.) of all the 
measures be formed then 



This follows strictly when the number of measures is indefinitely 
great. It still gives a good approximation to the value of h when the 
number is considerable. 

10, Among the measures greater than the mean there is one which 
may be called middlemost: i.e ., such that it is an even wager that a 
measure (greater than the mean) lies above it or below it. A similar 
middlemost measure exists among those that are less than the mean. 
As these two measures, with the mean, divide the curve of facility 
into four equal parts, I propose to call them the “ higher quartile ” 
and the a loiver quartile ” respectively. It will be seen that they 
correspond to the ill-named “probable errors” of the ordinary theory. 
If Q, q be the quartiles we can show that 

Mog-^-=*4769 . . . = — h log —, 
a a 

so that Qq—a 3 . Thus also 

Q 7 '(-6208 . . )=a*=0*( 1*611 . . ). 

Similarly between zero and lower quartile we place a mid-measure 
which we call the “ loiver octile The “ higher oetile ” will subdivide 
the interval between the higher quartile and infinity. If 0, o, be the 
octiles, we have— 

O h (-443 )=a h =o k (2*255 . . ) : 
as before Oo~as: and 



It will be observed that in the curve of distribution Q and q are 
ordinates equidistant from the mean and the terminal ordinates. 0 is 
equidistant from Q and the asymptote. 

The analogues suggested by the mean error, and mean square 
error, &c., of the ordinal theory have no very practical value or 
significance for us. It will be remembered that they are introduced 
to obviate the difficulties arising from negative errors. In our 
problem these somewhat artificial functions have no special place. # 


* The mean-square-measure is a 



greater than mean is a 


/*564 . . \ 
exp. (—r--) 


The geometric mean of the measures 
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The gauges of the series given by the weight and the quartiles and 
octiles will generally suffice. 

11. Let two fallible measures be given whose respective weights 
are known, and let their product be formed. This product will be a 
fallible measure like its factors. It can be shown that the product is 
subject to a law of facility of the form already obtained, and further 
that, as in the ordinary theory, the weight is such that the square of 
its reciprocal is equal to the sum of the squared reciprocals of the 
weights of the factors. More generally, if x x , x% . . . be a series of 

measures whose weights are 7q, h% . . . , and if z be connected 

with them by the equation 


z—x { 1 . . . ., 

Then h the weight of 2 is given by 

t=!h?-W+ 
w v V ” ” 

These results are strict; that which follows is closely approximate 
and applies to series whose weight is not small. 

Let / (cq, a 2 . . . ) be any function of the fallible measures 
ni, a 3 . . . , each of which may be the mean of some considerable 
number of measures, so that the weights are respectively hi, h 2 . . 

Required the weight to be assigned to the value of the function 
obtained from these fallible values of its variables. We are able to 
show that if hf be the weight in question 



12. It only remains to establish the practical method presented in 
§ 1. We have 


Chance of a logarithm 1 _ I Chance of a measure 


lying between y and y -f By J 1 lying between x and x + Bx 
__ h 

V 7T 


/ x\ Bx _ h - 3 

I exp .y—Jfi log 3 exp. (-%-loga ) 


my. 


This is the expression of the ordinary law of grouping of the ?/’s 
round their arithmetic mean (log a). 

It follows that, for example, the number of re’s between x r and x s 
must be proportional to 

-4- P' exp. ( — Wif~)dy: 

V7T Jyr 

this can be readily evaluated by means of the ordinary tables of the 
error-function. 

Again, still using h for the weight, the “ probable-error ” of the 
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ordinary theory 

h (probable error) = *4769 . . . 

. *. —h (log Q —log a) 

. *. log a-\- prob. error =log Q. 

In words—the term which differs from the Arithmetic Mean by 
probable error in the ^/-series is the log. of the qnartile in the a-series. 

We infer that no new tables are necessary for the practical working 
of our method. We require only the tables of the error-function and 
of hyperbolic logarithms. Of course common logarithms may be 
used, if we remember to introduce the appropriate modulus into our 
formulae. 

In conclusion, I desire to acknowledge my indebtedness to Mr. 
Galton, not only for the suggestion of the problem I have here 
attempted to solve, but also for many valuable practical hints in the 
working. 


XIV. 

(Received September 19, 1879.) 

Col aba, Bombay, 

29th August, 1879. 

Dear Sir, 

I regret that in my paper on the “ Diurnal Variations of the Wind 
and Barometric Pressure,” published in vol. xxv, p. 402, of the “ Pro¬ 
ceedings of the Royal Society,” I have inadvertently referred the 
Bermuda wind observations to civil time instead of to astronomical 
time. 

The correction of this error shows that the variation of the values 
of the hourly inequalities of the wind components at Bermuda is very 
much less during the hours when the sun is above the horizon than 
during those when he is below it, especially in the winter half year. 

The calculated barometric variations, depending on the wind varia¬ 
tions, should also be referred to astronomical instead of to civil time; 
but the general reasoning of the paper remains the same, leaving the 
principal portions of the variations (viz., those which have half a day 
for their period) unaffected. 

(Signed) F. CHAMBERS. 


The Secretary of the Royal Society. 
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Fig. 3 
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Fig. 5. 














